analysis consulting recovery data integration addiction webhotell gdpdu domene data cleaning data matching open source data quality alcoholism de duplication database data cleansing e-post deduplication research enterprise search